
STOP 



Early Journal Content on JSTOR, Free to Anyone in the World 

This article is one of nearly 500,000 scholarly works digitized and made freely available to everyone in 
the world by JSTOR. 

Known as the Early Journal Content, this set of works include research articles, news, letters, and other 
writings published in more than 200 of the oldest leading academic journals. The works date from the 
mid-seventeenth to the early twentieth centuries. 

We encourage people to read and share the Early Journal Content openly and to tell others that this 
resource exists. People may post this content online or redistribute in any way for non-commercial 
purposes. 

Read more about Early Journal Content at http://about.jstor.org/participate-jstor/individuals/early- 
journal-content . 



JSTOR is a digital library of academic journals, books, and primary source objects. JSTOR helps people 
discover, use, and build upon a wide range of content through a powerful research and teaching 
platform, and preserves this content for future generations. JSTOR is part of ITHAKA, a not-for-profit 
organization that also includes Ithaka S+R and Portico. For more information about JSTOR, please 
contact support@jstor.org. 



SOLUTION OF THE MATRIX EQUATION X~ l AX = N. 
By H. T. Burgess. 

If A is any square matrix, it is a well-known theorem in the matrix 
theory that there exists a similar matrix N which is of a unique canonical 
form.* The form of N is determined by the elementary divisors of the 
characteristic matrix \I + A. The existence of the matrix X is known, 
but no simple method of computing it is to be found in the literature. 
It is the object of this paper to give a simple and direct method. 

In the Annals of Mathematics, Vol. XVIII, No. 1, September, 1916, 
p. 4, is found a brief paper to which reference will be made as the former 
paper. 

1. Notation. Consider the linear substitution, 



Vi = bnXi + b u x 2 + 
2/2 = &21Z1 + O22X2 + 



+ b ln x n , 

~T~ 2n X n) 



Vn = b nl Xi + b n2 x 2 + ■ ■ • + b nn x n , 
and observe that it may be written in matrix notation in the form 



&11 b 12 
621 O22 



bm b n 



bin 




Xi 


bin 




x 2 


Unn 




Xn 



in which the ordinary rule of matrix multiplication applies. More briefly 
we may use y = Bx in which small letters denote matrices of n rows and 
one column, and the large letter denotes a square matrix of order n. If 
the matrix y is zero, we have a system of n linear homogeneous equations 
in n unknowns expressed in this notation by Bx = 0. 

It immediately appears from this notation that the system of linear 
homogeneous equations DBx = has each and every solution of Bx = 
as a solution. Further if the rank of the matrix DB is less than that of 
the matrix B, the system DBx = has solutions which are not solutions 
of the system Bx = 0. If k is any positive integer, every solution of 
Bx = is also a solution of B k x = 0, while any solution of B k x = which 

* For a demonstration of this theorem, see Hilton, Linear Substitutions, p. 29. 
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is not a solution of B k ~ x x = is not a solution of B"x = for a < k. This 
of course applies only when B is a singular matrix. 

2. Method of computation. By Theorem 2 in the former paper, we 
can compute the elementary divisors of the characteristic matrix AJ-f-A, 
and thus uniquely determine the form of the canonical matrix N for any 
given matrix A. The nature of the canonical form N is found in the 
" fundamental theorem " in the former paper and need not be restated 
here. 

From (a) X~*AX = N, we have (b) AX = XN. Any solution X 
of (a) is also a solution of (b) , and any non-singular solution of (6) is also 
a solution of (a) . If we can devise a scheme of finding all of the solutions 
of (b), we have only to select a non-singular solution of (6) to obtain a 
solution of (a). 

The necessary and sufficient conditions for X to be a solution of (b) 
are found by equating the corresponding elements by columns in the 
products AX and XN. Referring to the "fundamental theorem" in 
the former paper, we see that it is sufficient to consider only the t e columns 
corresponding to the typical component N tf in N. For this purpose, we 
write (b) in the form (a J + A)X = X(aJ + N). It is further obvious 
that the conditions for these t s columns can be found by considering only 
the e w columns corresponding to the typical component E, w in N tf . Using 
the notation previously explained, we denote the block of columns in X 
which corresponds to these e„ columns by 



x •'"' ) 



*li ( "> 



Xti 



(») 



(i = 1,2, ...,«,). 



Since in any column of the product {a J + A)X, only elements from 
the corresponding column of X appear, we get (aj + A)xi+i<-" ) in the 
column corresponding to the (i + l)st column of the component E etr . 
Likewise for i > 0, in the corresponding column of the product X(aJ-\-N), 
we get simply a;,- 00 ; for the only non-zero element in the (i + l)st column 
of a J + E e ^ is a 1 appearing in the over-principal diagonal. For 
i = 0, we get aV"' = 0, for the first column of aj + E, n contains only 
zeros. We can now express the t sets of equations to determine the r 
blocks of columns in X which correspond to the t e columns in the com- 
ponent N h as follows: 

*/-> = (aj + A)x i+1 <*\ i = 0, 1, 2, • • • e w , 

[C) *,<-> = 0, i = 0; Xi<"> +0, i 4= 0, ir = 1, 2, • • • r. 
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For convenience denote a,I + A by B, and consider the following 
set of matrix equations: 

Zo 00 = Bxi^ = BV° = BW V) = ■■■ = B'- 1 x K ^'> = B K x™ 

xS') = Bx^ = BW> = • • • = B'-^x K -i<'> = B K ~W> 

x^ = Bx z ™ = • • • = B K ~ s x K -i^ = B^x™ 
(d) 

x ,(-) = . . . = B'-tx^") = £"- V° 



a;,-! 00 = Bx« (,r) 

From this notation, it is obvious that this set of matrix equations is 
derivable from those equations which constitute the diagonal by direct 
matrix multiplication. If we choose x/" ) as a solution of B*x = and 
not a solution of B*~ l x = 0, we have Xo < -" ) = 0, and rci 00 , x 2 (,,) , • • ■ 
x K i*> 4= 0. To obtain a solution of the sets of equations (c), we take 
simultaneously 

7T = 1, 2, • • • r, 

« = ei, e 2 , • • • e T . 

This choice is evidently always possible, for the rank of B K is less than the 
rank of B"~ l for k Si e x by Theorem 2 of the former paper. Moreover 
the choice of x ic ( " ) is by no means unique, there being an infinity of ways of 
making the selection. 

Applying this same process to the remaining roots of the characteristic 
equation of A, we obtain the complete matrix X which is a solution of 
equation (b). 

Since all the solutions of (&) are to be obtained in this manner, to obtain 
a non-singular solution, it is only necessary to properly choose the matrices 
whose selection is by no means unique. 

In the application of this method, we can greatly simplify the actual 
computation of the elementary divisors and the matrix X if we observe 
first, that the reduction of any matrix B by elementary transformations 
on its rows only is equivalent to a matrix multiplication of the type TB 
where T is non-singular; secondly, that B and TB have the same rank; 
and thirdly, that the two systems of linear homogeneous equations Bx = 
and TBx = are equivalent. Consider the matrix equations 

(1) T\B = Ri where ri is the rank of B and Ri, 

(2) TsTiB 2 = T 2 RiB = R 2 where r 2 is the rank of B 2 and R 2 , 
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(k) T K ■■■ TiTiB* = T K R,-iB = R* where r K is the rank of B" and #„ 



Let T K be a non-singular matrix such that T K R K ^iB = R K where R K is 
reduced by elementary transformations on its rows only, then the ranks 
of B" and R K are the same and the two systems of linear homogeneous 
equations B*x = and R K x = are equivalent. Furthermore by the 
former paper, Theorem 2, we need to carry the work out only until we 
find n — r K = t e) where t, is the multiplicity of the factor X — a, in the 
characteristic equation of the matrix A. 
3. Illustration. Given the matrix 



A = 
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Its characteristic equation is found to be <p(\) = (X — 4)(X — 3) 2 (X — 2) 7 
= 0, and this shows that N is of the form 



N = 



Nr 



N 2 



N 7 



For the first column of X, we have to solve the matrix equation 

(4/ + A)x = 0. 

The solution found in the usual way is Xi = (0, 1, 0, 0, — 1, 0, 0, — 1, 0, 0). 
For the second and third columns of X, we must compute, from 
B = 3/ + A, 
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R 2 = 



10 
10 
10 
1 



10 0-1 



10 





0000-2 0010 
0000 01000 
0000 00100 
0000 00001 

By Theorem 2 of the former paper, we have 



iV 2 = 



- 3 1 

0-3 



and our equations to determine x 2 and x 3 are 

(1) = (37 + A)x 2 = (3/ + A) 2 x 3 , 



(2) 



x 2 = (3/ + A)x t . 



n = 9; 



To = 



We take x$ as a solution of R 2 x = and not a solution of RiX = 0. 
Such a solution is obviously x 3 = (0, 1, 1, 0, — 1, 0, 0, — 1, 0, — 1), and 
from (2) we compute x 2 = (1, 1, 0, 0, - 1, 0, 0, - 2, 0, 0). 

For the last seven columns of X, we compute from B = 21 + A 
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Ri 



Ri = 



10 


-22 1 1 12 








10 


32 4 9 -12 




10 



64 8 8 -24 
4 10 -1 10-4 


, n = 6; 





10 -66 -7 -12 26 







1 




81 


-47 132 41 64.5 262 -25 




972 


584 -744 -20 174 -2380 352 




486 


584 -744 -20 -312 -2380 352 


j 


0- 


-972 -410 276 530 249 2320 -94 






r 2 = 4; 


360 -233 


40 -472 932 170 426 2210 -276 




2 -1 


1 


; 


5177 - 


-3160 4888 -7028 1510 -2514 -17810 3444 










r 2 = 


3. 



i?3 = 



We have, from these ranks, 



N 7 = 



E 3 



E*> 



E, 



E, 



and the following equations to determine the last seven columns of X: 

(1) = (27 + A)x t = (27 + A)*x, = (27 + A) 3 z 6 , 

(2) x, = (27 + A) x s = (27 + A) 2 x 6 , 

(3) x s = (27 + A) xe, 

(4) = (27 + A)x 7 = (27 + A) 2 z 8 , 

(5) x 7 = (27 + A) a*, 

(6) = (27 + A)x 9 , 

(7) = (27 + A)x w . 

We must choose x e as a solution of Rsx = and not a solution of 
R 2 x = 0. Such a solution is x 6 = (390, 492, 102, 0, -492, 1, 0, -882, 
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-1, -102). Equations (3) and (2) give 

x b = (- 290, - 392, - 98, 0, 391, 1, 0, 686, - 1, 90), 

x 4 = (194, 306, 112, - 11, - 288, 16, - 32, - 500, 0, -88). 

Next we choose x 8 as a solution of R 2 x = and not a solution of 
Rix = 0. Such a solution is 

Xg = (_ 34, - 45, - 10, 0, 45, 0, - 1, 80, 0, 10), 

and equation (5) gives 

x 7 = (26, 42, 16, - 1, - 40, 2, - 4, - 68, 0, - 12). 

It remains to choose x$ and Xio, as solutions of R x x = 0, care being 
taken that they are independent of x 4 and x 7 . They may be taken as 

x, = (- 2, - 8, - 16, 5, 14, 0, 20, 0, 0, 0), 
in = (2, 18, 16, 5, - 24, 0, 0, - 20, 0, 0). 
This gives 



X = 
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